Return times of random walk on generalized random graphs 
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Random walks are used for modeling various dynamics in, for example, physical, biological, and 
social contexts. Furthermore, their characteristics provide us with useful information on the phase 
transition and critical phenomena of even broader classes of related stochastic models. Abundant 
results are obtained for random walk on simple graphs such as the regular lattices and the Cayley 
trees. However, random walks and related processes on more complex networks, which are often 
more relevant in the real world, are still open issues, possibly yielding different characteristics. In 
this paper, we investigate the return times of random walks on random graphs with arbitrary vertex 
degree distributions. We analytically derive the distributions of the return times. The results are 
applied to some types of networks and compared with numerical data. 

PACS numbers: 02.50.Ga, 05.40.Fb, 89.75.-k 



I. INTRODUCTION 

The theory of the random walk has a long history. Random walks and their extensions have also 
been applied with profound theoretical bases to modeling numerous types of physical, biological, 
sociological, and economical dynamics pj. For example, distributions of return times and scaled limit 
distributions of the walkers' positions are broadly known for simple underlying graphs. They provide 
useful information on critical values and phase transitions in regard to survival of the branching 
random walks and the contact processes S- 01 ■ survival in the voter models 0, Q , and occurrence 
of percolation [fj , 

Indeed, a large body of theoretical results are available for random walks performed on regular lat- 
tices such as Z and on the Cayley (or regular) trees, which are denned to be trees with homogeneous 
vertex degree. However, it has been suggested recently that more complex networks as opposed to 
regular graphs and conventional random graphs Q are concerned to real worlds. Particularly, impor- 
tant classes of random graphs such as small- world networks and scale- free networks were proposed and 
have been examined in the last several years. These networks share some important properties with 
real networks, such as the clusterin g p roperty, short average path length, and the power-law of the 
vertex degree distributions |8t Ifl Ulfilllj . They have been applied to the analysis of various biological, 
eng ineering, and social networks including information flow in the Internet H El EI and epidemics 
IllL IT^ . The properties of spatial stochastic models, both static configurations and dynamical pro- 
cesses, typically change as the network topology varies even when other basic quantities such as the 
mean vertex degree is conserved. For example, the analysis of percolation-based models revealed that 
the critical parameter values for the occurrence of global epidemics, or even their existence, depend 
on network topology 0, El El • 

It is highly likely that the properties of random walks depend on network topology 0,Q , as numerical 
and approximate results suggest for the quenched El and annealed Q Watts- Strogatz- type small- 
world networks and for quenched random graphs with homogeneous vertex degree 15] . In relation to 
this issue, how eigenvalues of the adjacency matrices are distributed has been numerically examined 
for scale- free and small- world networks |1(t| . The largest eigenvalue of an adjacency matrix measures 
how the number of closed paths increases as the path length tends to infinity. The eigenvalues supply 
useful information on the return times of random walks 0, serving to a wide range of applications 
as mentioned above. However, the largest eigenvalue p has been characterized only in terms of the 
numerical scaling law for the scale-free networks in an unnormalized manner, namely, p oc m 1 / 2 ^ 1 / 4 , 
5_i ■ where N is the system size and 2m is the mean vertex degree El ■ 

In this paper, we analyze random walks on a general class of random graphs that includes random 
scale- free networks, the Erdos-Renyi random graph, and the Cayley trees as special cases pi Ifjl ITol ITT| . 
Explicit expressions for the first return time probability and the annealed approximation forms for 
the general return time probability are derived with the use of partition of integers. In Sec. [H] wc 
introduce the network model and the generating functions. In Sec. II I II we calculate the probability 



o 
o 



distribution functions of the return time of random walk. Then, in Sec. IIVI we confirm with some 
examples that our theoretical estimates match numerical results. Lastly, the conclusion follows, and 
the difference in the decay rate of the return time probability between regular and random networks, 
which implies the difference in the possibility of percolation and the survival of contact processes, are 
also touched upon. 



II. NETWORK MODEL AND GENERATING FUNCTIONS 

We analyze a class of random graphs called generalized random graphs in physical contexts 
or Galton- Watson trees in mathematical contexts These random graphs are infinite trees without 

loops. The degree of each vertex, or the number of neighbors, is distributed according to an identical 
and independent probability density function. As shown in Fig. ^ each realization of the graph, 
which is generally inhomogeneous, is taken from the random ensemble. However, they are regular in 
a statistical sense. Let us denote by pk the probability that a vertex has the degree equal to k. We 
assume that po = without losing generality. Consequently, Y^kLiPk = 1- 

Let us designate an arbitrary vertex O of a realized graph as the root. We examine a random 
walk starting from O. Since we exclusively deal with trees here, the random walker can return to 
O only when the time n € {0,1,2,...} is even. In accordance, we denote by q n the probability 
that the random walker returns to O for the first time at time 2n, and by r n the probability that it 
returns to O irrespective of the accumulated number of returns. Here we consider only the annealed 
random walk, confining ourselves in the analysis of return times averaged over both probability space 
of graph and that of random walk. To be contrasted with the annealed randomness is the quenched 
randomness, which is concerned to the ensemble of walkers on a fixed realization of random graph [l7| . 
Both quenched [l3L ITEj l and annealed [l4| random walks have been implicitly treated in the studies of 
random dynamics on complex networks. Though quenched environments are realistic, the statistics 
based on annealed walks that we derive in the following can be regarded as averages of the statistics 
of quenched walks over the ensemble of a random graph. 

The generating functions for the distributions {q n } and {r n }, which we respectively denote by Q(z) 
and R{z) are defined by 

oo oo 

Q(z) = ii(z) = £V„z". (!) 

71=0 n=0 

With go = and rg = 1, Q{z) and R(z) satisfy the following relation: 

oc / n \ 

R(z) = ^ I ^ q m r n -m + d n ,o z n 

n— / 

= R(z)Q(z) + l, (2) 

where Sij = 1 for i = j and Sij = otherwise 0,0]. Strictly speaking, Eq. (J2J) is valid only for the 
quenched case. Therefore, the following results for R{z) should be understood as an approximation 
by annealed statistics. 



III. DISTRIBUTIONS OF RETURN TIMES 



To derive explicit expressions for the return time distributions, we provide the approximate recursion 
relation below. Let us resort to Fig.^for explanation. Suppose that the random walker starting from 
O returns to O after 2n steps for the first time (n = 7 in Fig. In the first step, the random walker 
moves to a neighbor that we denote by O 1 . Because of the statistical homogeneity of the generalized 
random graph, the vertex degree of O' is distributed as specified by {pk} whichever neighbor of O is 
chosen. The random walker has to arrive at O' at time 2n — 1 (= 13) and move to O subsequently 
at time 2n (= 14). The last event occurs with probability 1/k. In the meantime, the random walker 
travels for 2n ~ 2 steps without visiting O. The walker wanders in the subtrees rooted at O 1 to 
complete loops, or closed paths of random walk. Any such loop cannot contain O, and the probability 
that a path emanating from O' enters a subtree is (fc — l)/fc. Let us denote by a the number of the 



loops originating from O' . In Fig. ^ a is equal to 2. Then, the length 2nj (1 < i < a) of each loop 
is even (ni = 5 and ri2 = 1 in Fig. and 2rii must sum up to 2n — 2. In addition, since the vertex 
degree is homogeneously distributed, the probability law for the length of loop is assumed to be the 
same as that for the original random walk starting and ending at O. 

Here we make a crucial approximation of disregarding any memory effects. In other words, we 
suppose that the a subtrees rooted at O' are independent of each other. In fact, if the same neighbor 
of O' is chosen for different entries into the subtree, the subtrees reached by these different entries 
coincide. As an example, the random walker shown in Fig. ^ travels from A to the subtree rooted at 
B twice, before returning to O. In this occasion, it is not qualified to regard the vertex degrees and 
the loop lengths to be independent for the two neighbors of O' . However, the approximation error 
is small unless the mean vertex degree is extremely small. The accuracy of the following analytical 
methods are investigated in comparison with numerical simulations in Sec. IIVI 

Based on the consideration above, we have the following recursion formula: 

E x ^-4 x - k — 1 fc — 1 k — 1 1 

Pk ^ 2^ ~k~ qni ~k~ qn2 ■ ■ ■ ~k~ Qna k 

k=l o=0 Y^ a _ n*=n-l,n 4 >0,l<i<a 

fc=l a=0 n i=n-l,n i >Q,l<i<a a ' = 1 

which covers the singular case qo = as well. Using Eq. 0, the generating function of q n is calculated 

as 

^) = E^EtE(^T E n^M- 

^ Q _ i n i =n-X,n i >Q,l<i<a a ' = l 
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71=0 Y^ a - ni=n-l,m>0,l<i<a a '^ 1 
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^ k - (k - l)Q(z) 
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Although Q(l) = 1 is always consistent with Eq. we exclude this case because the random walk on 
generalized random graphs including the Cayley trees is transient @ , except for the Cayley tree with 
vertex degree 2, which is identical to Z. Accordingly, we look for the solution satisfying Q(l) < 1. 
By expanding the right-hand side of Eq. Q , we obtain 
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(5) 



where 



M{z) = J2 m n z n (6) 



n=l 

is the generating function of the moment function given by 



E ~ iuTT Pk- (7) 

k=l 



In deriving Eq. J3J), the expansion is justified by the fact that Q(z) has the radius of convergence equal 
to 1 and that (k — l)/k < 1. Then, we apply the following theorem to calculate Q(z) and R(z). 

Lagrange's inversion formula |18| Let z = w/f(w) where w/f(w) is an analytic function of w near 
w = 0. If 5 is infinitely diffcrcntiable, then 
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r [g'(u)f(uY 

du n-i ^ y V ' 



(8) 



u=0 



For our purpose, we set w[z) = Q{z), f(w) = M(w)/w, g{w) = w in Eq. (|HJ. Apparently, the fact 
that mi > guarantees the regularity of w/ f(w) around w = 0. As a result, we have 



Q{z) = g{w) 



n ( d n-l 



n\ 1 du 



fM{u) 
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We also note that 



M(u) n = (m 1 u + m 2 u 2 + •••)* 
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where Eah?i' indicates the summation over all the partitions of the integer n into integers. In general, 
a partition A is represented by A = (l lA ( :L )2* x ^ • • •), which means that 1 is included i\(l) times in 
A, 2 is included i\(2) times, and so on 0. By the definition of partition, {i\(l),i\(2), . . .} (Ah n') 
satisfies 



"£lh(l) = 52lix(l) = n'. (11) 
i=i i=i 

For example, 

{A | A h 5} = {(l 5 ), (1 3 2), (1 2 3), (12 2 ), (14), (23), (5)} , (12) 
{A | A h 7} - {(l 7 ), (1 5 2), (1 4 3), (1 3 2 2 ), (1 3 4), (1 2 23), (1 2 5), (12 3 ), (124), (13 2 ), 

(16),(2 2 3),(25),(34),(7)}. (13) 

In Eq. I|10|l . only the partitions whose numbers of parts are n are concerned. Corresponding to 
Eqs. Ijl2|l and Ijl3|l . the partitions appearing in the summation of Eq. IjlOjl for (n,n') — (3,5) and 
(4, 7) are as follows: 



Ah5,][>(0 = 3 = {(1 2 3),(12 2 )}, 
i=i ) 

oo ^ 

Ah7,J>(0=4 = {(1 3 4),(1 2 23),(12 3 )} 



(14) 
(15) 
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With Eq. iJTUjl. Eq. © is evaluated as follows 
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where 



Accordingly, 



\\-2n-l,J2Zi i ^= n 1=1 



OO 



(18) 



when n > 1, and 90 = 0. 

What is necessary for deriving R(z) is just to replace g(w) = w with g(w) = 1/(1 — u>) when 
applying Eq. (J8J. Using Eq. I|10(l . we obtain the annealed approximation form of R(z): 
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which results in 
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when n > 1, and ro = 1. 
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IV. EXAMPLES 



The analytical methods developed in Sec. IIHI can be broadly applied since the only assumptions 
that we have made on {pk} are po = and that the average vertex degree is not so small. In this 
section, we apply our theoretical estimates to random walk on some classes of graphs that are often 
relevant in real-world situations and also of theoretical interest. 



A. Cayley trees 

Let us first consider the Cayley trees ^(J in which each vertex has exactly d vertices. Substituting 
Pk = h,d into Eq. (@J yields 

«jw - jrjjhmr (21) 

Although Eq. I|21() has two different solutions of Q(z), the one satisfying Q(l) = 1 is excluded because 
of the transient nature of the random walk on the Cayley trees P, 0, • Then Eq. Ij21|l is led to 



(22) 



In this case, Q(z) is related to the generating function S(z) of Catalan numbers D n = 2nC n /{n + 1) 
fijl as follows: 



1- A /T~4l d-i ( <P \ 

S{Z) = 2z = — Q \J—1 Z ) ■ (23) 

Accordingly, we obtain 

V« = d 2nU (24) 

On the other hand, applying mi = (d— l) 1 ^ 1 /d l to Eq. 1)18(1 results in 



E 



= E (25) 

Combining Eqs. ((2411 and H25|) provides a useful by-product: 

E 4 n) =^n-l, (26) 

which states that the sum of the coefficients in the moment expansion of q n [see Eq. ()18|l] is always 
equal to D n -\ without regard to the distribution {pk}- 
Similarly, Eq. 1(19(1 becomes 



*M = (27, 



Then, it follows that 



and 



1-- — l z ) R [j — -z)=l-dzS(z) (28) 



^ = i-E(^r-)^- (M) 

n'=0 V / 

Owing to the entire homogeneity of the Cayley trees, Eqs. (122(1 and ((27(1 are exact in this case and agree 
with the theoretical results obtained by identifying random walk on the Cayley trees with unbiased 
random walk on Z . 



B. Erdos-Renyi random graph 



The Erdos-Renyi (ER) random graph is generated by independently assigning an edge with prob- 
ability p between any possible pairs of vertices 0, 1HJ LUl • If the number of vertices N scales so that 
A = Np converges in the limit N — > oo, the vertex degree is distributed as specified by the Poisson 
distribution, namely, 

Pk = -£j-c-\ (30) 

Numerically calculated distributions of the first return time are indicated by circles in Figs. Efa) 
and [21 b) for A = 7 and A = 10, respectively. The return probability decreases exponentially in n 



analogous to the case of the Cayley trees indicated by solid lines in Fig. |U 0, 0, 0|. Then many 
sample points are required for reliable estimation of the return time probability, for which reason we 
construct the probability distributions based on 5 x 10 7 runs. A new random graph is created in each 
run. 

Distributions predicted by the theory in Sec. lIIII are indicated by crosses in Fig. [21 The theoretical 
estimates agree with the numerical results better when A = 10. This is because our method works 
better for networks with a larger mean vertex degree, which is equal to A. However, the error is 
bearable in both cases for sufficiently small n for which the numerical distributions are calculated 
based on enough sample points. In other words, the minimum positive probability obtained by the 
simulations is 1/(5 x 10 7 ) = 2 x 10~ 8 , and the numerically estimated probabilities are not reliable 
around this value where statistical fluctuation counts. Related to this remark, Fig. shows that the 
numerical results are actually available just up to small values of n, that is, n < 17 for A = 7 and 
ii < 12 for A = 10. As noted before, this is due to the exponential decay in the return time distribution. 
Furthermore, the decay is faster for a larger mean vertex degree, or a larger A, which more severely 
constrains the practical upper limit of n for which the distribution is obtained. Compared with the 
cumbersome brute-force method, our method needs only calculation of partition of integers, which are 
much more numerically feasible. 

Figure [3 also shows, both for A = 7 and A = 10, that the decay of the first return time probability 
is slower for the ER random graphs than for the Cayley trees with the same mean vertex degree. This 
is presumably because of the dispersion of vertex degree in the ER random graph, as we discuss in 
Sec. E| 



C. Scale-free networks 



The vertex degrees of real networks often have power-law distributions. Barabasi and co-workers 
presented a network growth model with preferential attachment to generate such a graph 0, 
In their scale-free networks, the vertex degree has a lower cutoff to, and the degree distribution is 
represented by pk = Afk~ 3 (k > m) and pk — (fc < m), where J\f is the normalization constant. 
The first return time probabilities of random walk on scale-free random graphs with to = 4 are 
shown in Fig. |2| suggesting that the theory (crosses) again predicts the numerical results (circles) 
in a satisfactory manner. In this case, the mean vertex degree is numerically calculated to be 7.09. 
Accordingly, the results for the Cayley trees with d = 7 (solid lines) and d = 8 (dotted lines) are also 
shown in Fig.|3|for comparison. The probability of the first return time decays slower for the scale-free 
networks, as has also been the case for the ER random graphs. Moreover, comparison of Figs. |21 and 
13 reveals that the discrepancy from the regular case, which is probably caused by the heterogeneous 
vertex degree, is larger for the scale-free networks. This is presumably because the vertex degree is 
more heterogeneous in the scale-free networks than in the ER random graphs. 

Random walk on other related graphs, such as ones whose degree distributions have power laws 
without the lower cutoff, power laws with exponential higher cutoff, or simple exponential decay 
0, , can be analyzed similarly. The only caveat is that the theory is likely to fail when the 
vertex degree is fairly small on average. Let us also mention that there is little hope for obtaining 
more tractable analytical expressions for Q(z) and R(z) even in simpler scale-free cases, because the 
polylogarithm functions, which can be estimated only numerically [llj. appear in the calculation of 

TO/. 



V. CONCLUSIONS 



In this paper, we have derived for generalized random networks the analytic expressions for the 
probability distributions of first and general return times. Our methods correctly predict the numerical 
results as far as the mean vertex degree is not extremely small. They are also useful in saving the 
computation time and hence obtaining return time probabilities on a much longer time scale than 
with straightforward simulations. This merit stems from the fact that the algorithm for calculating 
partition of integers is easily implemented fl9| , whereas brute- force methods require billions of runs to 
obtain the distributions and the asymptotics, particularly in the case of exponentially decaying tails. 

We have also found that heterogeneous graphs such as the ER random graphs and the scale-free 
networks yield slower decay of return time probabilities than the Cayley trees with the corresponding 



vertex degrees. The decay rate is closely linked to critical phenomena and phase transitions of both 
static and dynamical particle systems. In social contexts, information and diseases are 

actually suggested to propagate in a manner different from as we imagine by the analogy of regular 
graphs such as the Cayley trees and regular lattices. For example, percolation is more likely to occur 
in networks with heterogeneous vertex degrees Also for dynamical processes such as contact 

processes and voter models , occurrence of global orders such as epidemics or unanimity 
has the same tendency. Mathematically, the problem of the global orders emerging in these dynamics 
can be associated with that of the dual or related processes. For example, if simple and branching 
random walks (resp. coalescing random walks) are more likely to return to the origin, the critical 
value for phase transition becomes smaller, and the probability of a global epidemics or unanimity 
becomes larger in contact processes 00] (resp. voter models 001). Accordingly, the asymptotic 
behavior of random walk reported in Sec. lIVI suggests that global orders are more likely consequences in 
networks with heterogeneous vertex degrees such as scale- free and ER random networks. This evidence 
substrates the results for the contact processes in epidemic contexts I 111 and poses a dynamical version 
of the exact results on percolation . 

As for exact asymptotic behavior, questions about the Cayley trees with vertex degree d is translated 
into ones about the unbalanced random walk on Z, the analysis of which easily resulting in r„ cx 
n -3/2 — 1/d) 0|. To illuminate the asymptotic behavior of q n and r„ in the case of generalized 
random walks is an important subject of future work. 
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Figure captions 

Figure 1: Schematic diagram showing random walk on a realization of generalized random graph. 
Integers denotes the time of random walk. 

Figure 2: Probability distributions of the first return times of the random walk on the ER random 
graph with (a) A = 7 and (b) A = 10. Numerical and theoretical results are indicated by circles and 
crosses, respectively. The results for the Cayley trees with the same mean vertex degrees, namely, (a) 
d = 7 and (b) d = 10, are indicated by solid lines. 

Figure 3: Probability distributions of the first return times in the case of scale-free networks with 
m = 4. Numerical and theoretical results are indicated by circles and crosses, respectively. The results 
for the Cayley trees with d = 7 (solid lines) and d = 8 (dashed lines) are also shown. 
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